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1 .   Introduct  ion 


Analysts'  forecasts  of  earnings  are  increasingly  used  in 
accounting  and  finance  research  as  expectations  data,  to  proxy 
for  the  unobservable  "market"  expectation  of  a  future 
realization.   Since  a  diverse  set  of  forecasts  is  available  at 
any  time  for  a  given  firm's  earnings,  composites  are  used  to 
distill  the  information  from  the  diverse  set  into  a  single 
expec"tation  .   This  paper  considers  the  relative  merits  of  several 
composite  forecasts  as  expectations  data.   One  of  the  primary 
results  is  that  the  most  current  forecast  available  outperforms 
more  commonly  used  aggregations  such  as  the  mean  or  the  median. 
This  result  is  consistent  with  forecasters  incorporating 
information  from  others'  previous  predictions  into  their  own.   It 
also  suggests  that  the  forecast  date,  which  previous  research  has 
largely  ignored,  is  a  characteristic  relevant  for  distinguishing 
better  forecasts. 

Researchers  have  used  a  variety  of  methods  to  aggregate 
analysts'  forecasts  into  a  single  expectation.   Barefield  and 
Comiskey  (1975),  and  Fried  and  Givoly  (1982)  use  the  mean  of  a 
set  of  forecasts.   Brown  and  Rozeff  (1978)  use  a  single  forecast 
from  ValueLine.   Givoly  and  Lakonishok  (1979)  select  the  "most 
active"  forecaster  for  each  firm  from  those  available  in  Standard 
and  Poors'  Earnings  Forecaster.   Elton,  Gruber.  and  Gultekin 
(1981)  and  Brown,  Foster  and  Noreen  (1984)  consider  both  means 
and  medians  published  in  the  1/B/E/S  Summary  database.   Brown, 


Griffin,  Hagerman  and  Zmijewski  (1985)  combine  a  ValueLlne 
forecast  with  several  time-series  forecasts.   I  consider  three 
composites  from  a  set  of  available  forecasts:   the  mean,  the 
median,  and  the  most  current  forecast. 

The  use  of  predictions  from  univariate  time-series  models  of 
earnings  as  earnings  expectations  has  been  more  common  than  the 
use  of  analysts'  forecasts,  in  part  because  of  data  availability. 
However,  several  studies  (Brown  and  Rozeff  (1978),  Collins  and 
Hopwood  (1980),  Fried  and  Givoly  (1982))  demonstrate  that 
analysts  are  more  accurate  than  univariate  models,  presumably 
because  they  can  incorporate  a  broader  information  set  than  can  a 
univariate  model.   Fried  and  Givoly  also  find  that  analysts' 
forecast  errors  are  more  closely  associated  with  excess  stock 
returns  than  are  those  of  univariate  models.   An  additional 
limitation  of  time-series  models  is  their  substantial  data 
requirements,  which  impart  a  sample  selection  bias  to  the 
research,  toward  longer-lived  and  larger  firms.   Since  analysts' 
forecasts  require  no  parameter  estimation,  sample  selection  bias 
is  less  severe . 

I  add  to  the  literature  comparing  analysts  with  time-series 
models  by  examining  two  quarterly  models  as  benchmarks  for  the 
analysts.   My  results  on  the  relatively  greater  accuracy  of 
analysts  are  consistent  with  those  of  previous  research.   The 
result  that  analysts'  forecasts  more  closely  reflect  the 
information  impounded  in  stock  returns,  however,  is  not 
replicated  here.   Adding  time-series  models  has  the  drawback. 


noted  above,  that  the  sample  size  is  substantially  reduced  by 
their  data  requirements.!   However,  differences  between  my 
results  and  those  reported  elsewhere  justify  the  inclusion  of 
time-series  models  in  the  paper. 

In  section  2,  I  describe  several  proxies  for  the  analysts' 
consensus  and  two  quarterly  time-series  models  used  in  the 
empirical  tests.   In  section  3  I  present  the  methodology  for 
comparing  different  forecasts,  and  in  section  4  describe  the 
data.   The  results  are  discussed- in  section  5,  and  section  6  is  a 
summary  with  some  concluding  remarks. 


2^ P£ox  ies_f  or_Ex2ected_Ear  ni^ngs 


2.1   Defining  Consensus  for  Analysts'  Forecasts 


The  motivations  for  seeking  a  consensus  expectation  when 
many  forecasts  are  available  are  primarily  practical.   First,  if 
individual  forecasts  contain  idiosyncratic  error  which  can  be 
diminished  or,  ideally,  completely  diversified  away,  then  more 
accurate  forecasts  can  be  obtained,  by  combining  forecasts  from 
different  sources  into  a  single  consensus  number. 2   Second,  in 
many  contexts  the  earnings  expectation  is  not  the  central  issue, 
but  is  a  necessary  piece  of  data.   For  example,  to  control  for 
effects  of  simultaneous  earnings  releases  in  measuring  other 
information  events,  a  measure  of  unanticipated  earnings  is 


needed.   To  obtain  unanticipated  earnings,  an  accurate  earnings 
expectation  is  required.   If  expected  earnings  are  measured  with 
error,  as  they  almost  surely  are  when  many  heterogeneous 
forecasts  are  available,  the  power  of  subsequent  tests  is 
reduced . 

In  studies  of  macroeconomic  forecasts,  the  precision- 
weighted  average  has  been  proposed  as  the  "optimal"  combination 
of  forecasts. 3   If  f  is  a  vector  of  forecasts  of  a  single  number 
from  several  sources,  and  if  ^  is  the  variance-covar lance  matrix 
of  contemporaneous  covariances  between  forecast  errors  from 
different  sources,  then  the  precision-weighted  average  f*  is 
defined  : 


f   =  \'  f 


(E~^i)/(lT  ^) 


(2.1 
(2.2 


where  \^  i  s  a  vector  of  ones,  and  X  is  the  vector  of  precision- 
weights  applied  to  forecasts. 

The  precision-weighted  average  defined  in  (2.1)  and  (2.2)  is 
the  optimal  combination  under  quadratic  loss  if  f  comprises 
unbiased  forecasts  for  a  single  series  and  the  precisions  are 
known  (Bates  and  Granger  (1969)).   The  question  remains  open, 
however,  whether  precision-weighted  averages  are  better  than 
alternatives  in  any  empirical  sense.   Several  examples  will 
illustrate  this  point.   First,  if  quadratic  loss  is  not  the 
correct  criterion,  then  f*  is  not  necessarily  optimal.   For 
example,  if  absolute  error  is  the  appropriate  loss  function,  the 


precision-weighted  average  is  inferior  to  the  median  (see  DeGroot 
(1970)  for  a  proof).   Second,  if  the  precisions  are  unknown  and 
must  be  estimated,  the  performance  of  the  precision-weighted 
average  will  depend  on  the  data.   If,  for  example,  covarlances 
between  different  sources  are  not  stable  over  time,  then  the 
weights  cannot  be  estimated  from  existing  data.   Third,  if  a 
cross-section  of  forecasts  is  considered,  rather  than  a  single 
series,  then  the  series-by-series  precision-weighted  averages  are 
inferior  under  quadratic  loss  to  certain  shrinkage  estimators, 
commonly  called  Bayes-Stein  estimators  (see  James  and  Stein 
(1961)). 

For  analyst  forecast  data,  the  second  of  the  above 
considerations,  estimating  Y>     is  of  particular  concern.   To 
estimate  the  covariance  between  two  analysts,  a  time-series  of 
their  predictions  for  a  firm  or  set  of  firms  is  needed.   Such 
consistent  time-series  are  extremely  rare  in  analyst  forecast 
data,  since  the  identities  of  the  individual  analysts  predicting 
a  given  firm's  earnings  varies  considerably  from  year  to  year. 
The  unbalanced  nature  of  the  data  precludes  the  use  of  precision- 
weights  on  a  large  sample  of  firms  or  analysts. 4 

I  consider  the  mean  forecast, ^  an  extreme  simplification  of 
the  precision-weighted  average.   The  theoretical  circumstances 
under  which  the  mean  is  an  optimal  forecast  under  quadratic  loss 
are  fairly  restrictive:   the  forecast  errors  from  different 
sources  must  be  independent,  and  the  forecasts  must  be  of  equal 
precision.   The  empirical  justification  for  the  extreme 


simplification  is  more  compelling.   Given  the  lack  of  consistent 
and  stable  series,  weights  are  likely  to  be  poorly  estimated, 
adding  noise  to  the  composite  formed  from  them. 

A  second  definition  for  the  consensus  examined  here  is  the 
median  of  a  set  of  forecasts.   The  median  is  the  optimal 
combination  under  absolute  error  loss,  and  is  less  sensitive  to 
outliers  than  the  mean  and  related  weighted  averages. 

An  implicit  assumption  behind  the  use  of  either  the  mean  or 
the  median  forecast  to  represent  the  consensus  is  that  all 
forecasts  are  current,  so  that  cross -sec t lona  1  differences  in 
forecasts  are  attributable  to  differential  use  of  the  same  global 
set  of  information.   Gains  from  combining  forecasts  arise  either 
from  the  employment  of  more  information  in  the  aggregate  than  is 
used  by  any  individual,  or  from  diversification  across 
individuals'  idiosyncratic  errors. 

In  fact,  however,  the  analysts'  forecasts  available  at  any 
time  vary  on  how  recently  they  were  produced.   I  refer  to  this 
feature  as  the  age  of  the  forecast.   If  analysts  incorporate 
information  from  others'  forecasts  in  producing  their  own,  then 
more  current  forecasts  may  be  more  accurate,  and  possibly  closer 
to  a  "market"  expectation.   I  cons,ider  the  extreme  application  of 
this  argument,  by  examining  the  most  current  forecast  as  a 
definition  of  consensus. 

The  motivation  for  using  the  most  current  forecast  as  a 
consensus  expectation  addresses  the  context  in  which  forecasts 
are  most  likely  used.   The  implicit  assumption  is  that,  since 


information  is  disseminated  relatively  rapidly,  recent  releases 
have  more  value  than  previous  ones.   The  value  of  an  earnings 
expectation,  however,  depends  upon  how  it  is  used  by  investors 
and  others  in  decisions.   The  mechanics  by  which  earnings 
information  is  translated  into  prices  or  other  outcomes  are  far 
from  well-understood.   The  most  current  forecast  is  used  here 
because  it  is  observable,  and  is  consistent  with  onr 
understanding  of  information  in  capital  markets. 

2.2   Quarterly  Time-series  Models  of  Earnings 

I  use  quarterly  ti««-series  models  of  earnings  as 
benchmarks,  against  which  analysts'  forecasts  are  compared. 
Analysts  have  the  advantage  of  a  much  broader  information  set 
than  is  employed  in  a  univariate  model,  including  industry  and 
firm  sales  and  production  figures,  general  macroeconomic 
information,  and  other  analysts'  forecasts,  in  addition  to  the 
historical  series  of  earnings.   Analysts'  forecasts,  therefore, 
are  likely  to  be  more  accurate  than  forecasts  from  univariate 
mode  1 s . 

An  advantage  of  the  univariate  time-series  approach  is  the 
relative  ease  with  which  earnings  data  can  be  obtained  for 
moderate  samples  of  firms.   This  is  tempered  by  the  caveat  that 
the  data  requirements  of  the  models  impart  a  "survivorship"  bias 
to  samples.   Another  advantage  to  time-series  models  is  the 
relative  simplicity  of  the  models  used  to  generate  expectations. 
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Parsimonious  models  with  a  single,  simple  ARIMA  structure  applied 
to  all  firms  have  been  shown  to  predict  at  least  as  well  as 
univariate  models  with  more  complex,  individually-specified 
structures,  when  one-step-ahead  forecast  errors  are  compared 
cross-sectional ly  .  5 

The  primary  shortcoming  of  univariate  time-series  models  as 
expectations  models  is  their  inability  to  adjust,  between 
earnings  announcements,  for  relevant  new  information.   Models 
using  quarterly  data  to  predict  annual  earnings  represent  an 
improvement  over  models  using  only  annual  data,  by  allowing 
revisions  of  expectations  quarterly,  instead  of  only  annually 
(see  Hopwood,  McKeown,  and  Newbold  (1982)). 

A  second  drawback  to  the  use  of  time-series  models  is  the 
underlying  assumption  of  an  earnings  process  which  is  unchanged 
over  time.   Since  accounting  research  frequently  examines  events 
which  Involve  changes  in  reported  earnings,  representing  changes 
in  underlying  cash  flows,  decisions,  or  the  accounting  reporting 
process,  this  important  assumption  is  frequently  violated. 

I  use  the  following  two  quarterly  time-series  models,  from 
Foster  ( 1977  )  : 


and 


'f^jtq^  =  ^jtq-4  ^  «jO  *  «jl<^jtq-l  "  ^tq-5 


E  r  a    1  =  a       +  e 
^  jtq^      jtq-4    "j2 


(2.3) 


(2.4) 


where  ajtq  denotes  quarterly  reported  earnings  for  firm  j  in 
quarter  q  of  year  t,  and  Sjo.  eji  and  ej2  are  estimated 
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paraaeters.   The  models  are.  respectively,  a  first  order 
autoregressive  process  in  the  fourth  differences  with  a  drift, 
and  a  random  walk  in  the  fourth  differences  with  a  drift.   These 
models  are  chosen  because  of  their  relative  simplicity,  and 
because  they  have  proven  to  be  at  least  as  good  as  other 
mechanical  quarterly  models.   The  data  used  and  estimation  of 
these  models  are  described  in  section  4. 


3^ Me thodol ogx 

3.1   Measuring  Forecast  Errors 

Forecast  errors  are  the  elementary  data  I  use  to  evaluate 
forecasts.   The  forecast  error  is  defined  as  the  difference 
between  actual  earnings  per  share  (EPS)6  of  firm  j  in  year  t  and 
the  forecast  of  EPS  from  source  i,  at  a  horizon  T  prior  to  the 
realization: 


IjtT      jt      ijtx 


(3.1) 


The  source  of  the  forecast,  d^e noted  by  i,  is  one  of  the 
following:  the  mean,  the  median,  or  the  most  current  of  a  set  of 
analysts'  forecasts;  or  one  of  the  two  benchmark  quarterly  models 
described  in  section  2.2.  The  composite  analyst  forecasts  are 
constructed  from  a  set  of  forecasts  which  includes  the  most 
recent  forecast  from  each  individual  in  the  database,  prior  to  a 
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fixed  horizon  date.  Forecast  selection,  the  database  used,  and 
the  forecasting  horizon  are  described  in  more  detail  in  section 
4  . 

In  addition  to  the  raw  forecast  errors  defined  in  equation 
(3.1),  I  present  results  for  two  other  scales:  forecast  errors 
scaled  by  the  average  absolute  deviation  of  the  past  five  years 
changes  in  EPS , 


se 


ijtx 


i  jtT 


t-1 

E 
s  =  t-5 


(3.2) 


AA 


js 


and  forecast  errors  as  a  percent  of  the  absolute  value  of  the 
prior  year's  EPS, 


1  JtT 


(3.3) 


There  are  two  reasons  for  considering  several  scales  for 
forecast  errors  in  this  study.   The  first  is  comparability  with 
previous  work.   The  second  is  that  EPS  forecast  errors  may  be 
heteroscedas t i c  .   That  is,  there  may  be  differences  in 
predictability  of  EPS  across  firms^,  perhaps  related  to  firm  size, 
earnings  variability,  or  available  information. 

Equation  (3.2)  scales  forecast  errors  by  the  average 
variation  in  previous  years'  EPS,  as  a  benchmark  for 
predictability.   The  average  variation  is  measured  here  by  the 
average  absoiute  value  of  past  changes.   Phis  measure  is  less 
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sensitive  to  outliers  than  the  standard  deviation,  and  describes 
the  variablility  of  a  random  walk  with  trend  when  loss  is 
proportional  to  absolute  error.? 

Variants  of  (3.3)  have  been  used  frequently  in  previous 
research  to  create  a  unitless  neasure  (e.g.,  Cragg  and  Malkiel 
(1966),  Brown  and  Rozeff  (1978).  Beaver,  Clarke  and  Nrigbt 
(1979).  Malkiel  and  Cragg  (1981),  and  Elton,  Gruber  and  Gultekin 
(1981)).   Equation  (3.3)  is  appropriate  if  forecasts  of  EPS 
growth,  not  EPS,  are  relevant  for  decisions.   Alternatively,  if 
predictability  is  proportional  to  the  level  of  EPS,  then  equation 
(3.3)  controls  for  cross-sectional  variation  in  predictability. 

3.2   Aggregating  Forecast  Errors 

If  a  forecast  incorporates  all  the  information  available  on 
the  forecast  date  in  an  unbiased  manner,  then  it  can  be  described 
as  an  expectation  in  the  usual  statistical  sense  of  the  word. 
Let  f**  denote  such  a  forecast: 


fjtx  =  ^t^■tl^T  1    '  ^^'^ 


where  (tt^^j    represents  the  informati\on  available  at  a  horizon  x 
prior  to  the  realization,  and  E[  |  ]  is  the  conditional 
expectation  operator. 

In  the  span  of  time  between  the  forecast  date  and  the 
realization  date,  new  information  may  arrive.   Even  a  forecast 
like  (3.4),  which  may  be  ideal  in  the  sense  of  employing  all 
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information  available  on  the  forecast  date,  will  omit 
unanticipated  information  which  arrives  later.   Forecast  errors 
consist,  in  part  or  entirely,  of  new  information  revealed  over 
the  forecast  horizon,  i.e.  between  forecast  and  realization. 

There  are  two  closely-related  implications  of  new 
information  being  impounded  in  forecast  errors.   First, 
contemporaneous  forecast  errors  will  not,  in  general,  be  cross- 
sectionally  independent  observations.   Second,  forecast  errors 
within  a  year  which  are  aggregated  cross-sec t iona 1 ly  may  appear 
to  be  "biased"  because  of  the  common  new  information  reflected  in 
them  . 

Events  such  as  unanticipated  macroeconomic  shocks  may  affect 
many  firms  in  a  similar  manner,  inducing  a  correlation  in 
forecast  errors  for  different  firms  for  the  same  year  and 
horizon.   Moreover,  since  different  forecast  horizons  within  a 
year  overlap,  errors  are  correlated  across  horizons  within  a 
year,  in  general. 

If  forecast  errors  are  positively  correlated  across  firms 
within  years,  then  statistical  comparisons  based  on  data  from  a 
short  time-series  which  assume  that  observations  are  cross- 
sectionally  independent  will  over s^t ate  the  statistical  validity 
of  the  results.  Several  studies  (Brown  and  Rozeff  (1978),  Elton, 
Gruber  and  Gultekin  (1981),  Malkiel  and  Cragg  (1980).  for 
example)  have  compared  forecasts  using  criteria  such  as  the 
number  or  proportion  of  times  that  one  forecasting  method 
outpredicts  another.   This  criterion,  or  any  other  which  assumes 
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A  simple  model  of  time-period  effects  in  forecast  errors 


is  :  9 


"jtT  =  ^T  ^  ^jtT 


(3.6) 


where  Utx  is  the  average  forecast  error  for  year  t  and  horizon  x, 
and  Tij^^  jg  a  random  error  term,  representing  the  deviation  of 
firm  j's  forecast  error  from  the  common  annual  mean. 

I  estimate  (3.6)  using  least  squares  with  a  dummy  variable 
for  each  year. 10   The  test  for  bias  is  based  on  the  grand  mean  of 
the  estimated  annual  averages: 


1    ^  - 
=  -   E  u 

T         tT 

t-1  ^ 


(3.7) 


where  T  is  the  number  of  years  in  the  sample,  and  the  u^i    are  the 
year-specific  average  forecast  errors,  estimated  separately  for 
each  horizon  T.   The  average  u^  defined  in  (3.7)  is  a  linear 
combination  of  least-squares  coefficients,  with  estimated 
standard  error: 


S>    -'    T^     ■      [i'(X'X)~^]^/^ 


(3.8) 


where  i^  i  s  a  vector  of  ones  of  length  T,  X  is  the  matrix  of  dummy 

variables,  and  s^^  js  the  regression  residual  standard  error.   For 
the  model  given  by  equation  (3.6),  equation  (3.8)  reduces  to: 


S- 


T 

[  E 
t-i 


2_ 
J 


1/2 


(3.9) 
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where  J-j-  j^s  the  number  of  observations  in  year  t.   The  standard 
error  for  the  bias  test  is  a  weighted  version  of  the  residual 
standard  error  8^,    constructed  fron  the  tij^t-   The  residuals 
'^jtt  are  deviations  from  the  annual  averages,  and  so  are  purged 
of  this  ti«e-per iod-specif ic  inforBation  which  induces  cross- 
sectional  correlation  in  the  ej:{_^.   The  precision  contributed  by 
each  annual  average  Utx  to  the  precision  of  the  mean  Ux  is 
determined  by  the  overall  standa-rd  error,  Sn.  and  the  number  of 
observations,  J^ ,  in  that  year. 

The  ratio  of  (3.7)  to  (3.9)  is  evaluated  as  a  t-statistic, 
and  is   used  to  measure  forecast  bias  for  each  analyst  coaposite, 
mfeur;,  mediar.  and  most  current,  and  for  the  quarterly  models. 

3.4   Evaluating  Forecasts  -  Accuracy 

I  use  an  approach  similar  to  the  bias  evaluation  described 
in  the  previous  section  for  evaluating  relative  forecast 
accuracy.   Accuracy  is  defined  as  absolute  or  squared  forecast 
error.   The  model  is: 


e  .   1=5     +8     +  t  . 

jtx'       IJT      2tT      jtT 


(3.10) 


or  : 


JtT 


-  Y      +  Y      +  C 

IJT      2tT      jtT 


(3.11) 


wheie  the  Sij^-  and  the  Yijx  measure  average  accuracy  for  each 
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firm  J,  and  the  62tT  and  Y2tT  measure  average  accuracy  for  each 
year  t.   The  tjtT  and  the  CjtT  are  deviations  from  the  average 
accuracy  in  this  sample  for  firm  j  and  for  year  t.   Differences 
in  accuracy  across  firms  could  arise,  for  example,  If  there  are 
persistent  differences  in  the  amount  of  information  available  for 
different  firms.   Differences  in  accuracy  across  years  could 
arise  if  there  are  more,  or  larger,  unanticipated  events  in  some 
years  than  in  others. 

Equations  (3.10)  and  (3.11)  are  estimated  using  least 
squares  on  a  set  of  dummy  variables  for  firms  and  years.   Average 
accuracy  is  computed  in  a  manner  analogous  to  equation  (3.6),  as 
a  linear  combination  of  the  estimated  effects:ll 


^  .?:   '^^^  * 


t  =  i 


2JT 


(3.12) 


or  : 


'    .^. 


IJT 


t  =  l 


2JT 


(3.13) 


Equation  (3.12)  gives  the  average  absolute  error  accuracy,  and 
(3.13)  gives  the  average  squared  error  accuracy.   The  average 
accuracies  in  equations  (3.12)  and^  (3.13)  have  standard  errors 
which  are  estimated,  respectively  as: 


^J 


s^  .  [a)'(Z'Z)  ^o)]^^.^ 


(3.14) 


and  : 


-  ]   1/2 
[W'  (Z'Z)   y]-^' 


(3.15) 
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In  (3.14)  and  (3.15),  Z  represents  the  natrlx  of  dufflmy  variables 
used  to  estimate  equations  (3.10)  and  (3.11),  u)  is  the  vector  of 
weights  that  transforn  the  estlaated  parameters  into  the  averages 
defined  in  (3.12)  and  (3.13).  and  s^  and  Sf  are  the  residual 
standard  errors  from  the  regression  equations. 

The  estimates  in  equations  (3.12)  through  (3.15)  are 
computed  for  the  mean,  median  and  most  current  analyst  forecasts, 
and  for  the  forecasts  from  the  quarterly  models.   Pairwise 
differences  in  accuracy  are  compared  across  forecast  sources 
using  a  t-statistic  constructed  from  the  average  accuracies  from 
(3.12)  or  (3.13)  and  their  standard  errors  from  (3.14)  and 
(3.15). 

3.5   Evaluating  Forecasts  -  Market  Association 


The  criteria  developed  in  the  previous  two  sections,  bias 
and  relative  accuracy,  are  objective  and  reasonably  common  in 
evaluation  of  forecasts.   They  do  not,  however,  address  the 
context  in  which  the  forecasts  are  used.   Both  researchers'  and 
investors'  use  of  forecast  data  iUx  contexts  related  to  securities 
markets  suggests  that  association  with  stock  returns  may  provide 
a  relevant  empirical  comparison. 

In  section  3.2  I  argued  that  forecast  errors  will  Impound 
information  arriving  after  the  forecast  date.   Subject  to  the 
qualifications  discussed  below,  this  implies  that  the  forecast 
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error  will  be  positively  correlated  with  new  information 
impounded  in  stock  returns  over  the  forecast  horizon. 

I  measure  the  new  information  impounded  in  stock  returns  by 
the  cumulated  excess  returns  from  a  market  model.   The  excess 
return  is  the  deviation  of  the  observed  return  Rjg  from  its 
expectation  based  on  the  model: 

E[ln(l  +  R.,)]  =  a.  +  6.1n(l  +  R„^)         (3.16) 

J  S  J        J  MS 

where  R^g  is  the  return  on  a  market  portfolio  of  securities.   The 
data  and  estimation  of  parameters  in  equation  (3.16)  are 
discussed  in  section  4. 

Excess  returns,  not  raw  stock  returns,  are  used  since  excess 
returns  represent  unanticipated  returns.   Inf ormat ional  ly 
efficient  forecasts  will  have  forecast  errors  comprising  only 
unanticipated  information.   Forecasts  which  are  not 
inf ormat ional ly  efficient  will  have  forecast  errors  consisting  of 
both  unanticipated  information  and  information  which  was 
available  at  the  forecast  date  but  was  not  incorporated  into  the 
forecast.   Forecast  errors  from  inefficient  forecasts  may 
therefore  be  correlated  with  the  anticipated  component  of  stock 
returns.   This  correlation  could  cause  a  stronger  association 
with  raw  returns  to  be  observed  for  inefficient  forecasts  than 
for  efficient  forecasts.   This  argues  against  using  the 
association  between  raw  returns  and  forecast  errors  as  a 
criterion  for  evaluating  forecasts,  and  in  favor  of  using  only 
the  unanticipated  portion  of  returns. 

Two  qualifications  to  the  implied  association  between  the 
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information  reflected  in  excess  stock  returns  and  that  reflected 
in  earnings  forecast  errors  are  worth  noting.   First,  the 
information  relevant  to  valuing  the  firm's  common  stock  is  not 
precisely  the  same  as  the  information  relevant  to  current-year 
earnings.   There  are  errors  in  both  variables  with  respect  to  the 
measured  association  between  them  which  represents  the  common 
information.   Non-recurring  events,  whether  they  are  treated  as 
extraordinary  items  or  not,  may  affect  earnings  in  one  particular 
year,  but  may  be  inconsequential  to  the  long-term  value  of  the 
firm.   Conversely,  events  that  influence  longer-term  prospects, 
such  as  changes  in  investment  opportunities,  may  affect  the  value 
of  the  firm  without  altering  current  earnings. 

Second,  excess  returns  are  constructed  to  exclude  one  source 
of  unanticipated  information.   The  excess  return  is  purged  of  its 
systematic  relation  with  market  returns,  which  includes  both 
anticipated  and  the  unanticipated  components.   Elimination  of  the 
anticipated  component  is  desirable,  as  discussed  above. 
Elimination  of  the  unanticipated  portion,  however,  will  reduce 
the  measurable  association  between  excess  returns  and  forecast 
errors.   Correction  of  this  problem  requires  a  model  of  the 
expected  return  on  the  market,  so  that  the  unexpected  component 
can  be  isolated.   However,  while  the  power  of  the  tests  for 
positive  association  is  reduced,  the  reduction  in  power  does  not 
vary  across  forecast  sources.   It  will  not  affect  the  relative 
degree  of  association  across  sources. 

Both  qualifications  noted  above  will  have  the  effect  ol 
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reducing  the  measurable  association  between  forecast  errors  and 
excess  returns.   Nevertheless,  previous  studies  facing  the  same 
inherent  difficulties  have  found  statistically  significant 
positive  associations  between  unexpected  earnings  and  excess 
stock  returns  (Ball  and  Brown  (1968),  Beaver,  Clarke  and  Wright 
(1979),  and  Fried  and  Givoly  (1982).  for  example). 

The  regression  model  used  to  estimate  the  association 
between  cumulated  excess  returns,  represented  by  Uj^-.^   and 

forecasterrors,ejj^^_is: 


e..^   =a,..   +a^.^   +8.   U.^   +v..^ 

IjtT      llJT      2ltT      IT    jtT      IjtT 


(3.17) 


In  (3.17),  B^-fUjtx  is  the  portion  of  the  forecast  error  from 
source  i  at  horizon  T   which  is  systematically  related  to  excess 
returns.   The  slope  coefficient  6i-f  is  the  covariance  between 
excess  returns  and  forecast  errors,  adjusted  for  firm  and  year 
effects,  in  units  of  the  variance  of  excess  returns.   Using 
excess  returns  as  the  independent  variable  and  forecast  errors  as 
dependent  has  the  desirable  feature  that  6  ^ -^  and  its  associated 
t-statistic  have  the  same  scale  for  all  sources  i.   If  the  roles 
of  these  two  variables  were  reversed  in  the  regression  equation, 
the  estimated  regression  slope  coefficient  would  depend 
explicitly  on  the  forecast  error  variance  from  source  i. 

The  constants  a^^j^j^j^  and  a2itT  measure,  respectively,  firm- 
and  year-specific  average  forecast  errors,  conditional  on  the 
systematic  relation  with  excess  returns.   The  ot2itx,  the  year 
effects,  play  an  important  role  in  equation  (3.17),  since  they 
capture  the  t i me- per i od- spec i f i c  information  in  forecast  errors 
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which  is  not  captured  by  excess  returns. 12   Among  other  things, 
they  include  the  average  effect  of  omitting  the  unanticipated 
component  of  the  market  return.   If  the  a2itT  are  not  included  in 
the  model,  then  they  are  impounded  in  the  regression  residuals  as 
an  omitted  variable.   This  induces  cross-sectional  correlation  in 
the  residuals,  which  if  ignored  leads  to  incorrect  statistical 
inferences,  as  was  discussed  above  for  the  bias  computation . 13 

Equation  (3.17)  is  estimated  by  stacking  the  regressions  for 
the  five  forecast  sources,  and  estimating  them  jointly.   The 
forecast  sources  are  the  mean,  median  and  most  current  analyst 
forecast  and  the  two  quarterly  models.   Estimations  are  performed 
jointly  for  the  five  forecast  sources,  and  separately  for  each 
forecast  horizon. 

Since  the  matrix  of  independent  variables,  consisting  of 
cumulated  excess  returns  over  the  forecast  horizon  and  dummy 
variables  indicating  firms  and  years,  is  the  same  for  each  of  the 
forecast  sources,  there  is  no  efficiency  gain  over  equation-by- 
equation  least  squares  (see  Zellner  (1962)).   The  advantage  of 
stacking  the  equations  is  for  joint  estimation  of  the  firm  and 
year  effects,  so  that  observations  from  each  of  the  five  forecast 
sources  are  adjusted  for  the  same  ^firm  and  year  effects. 

A  further  advantage  of  stacking  the  equations  is  that 
statistical  testing  is  simplified,  since  a  set  of  linear 
constraints  on  the  estimated  slope  coefficients  8  j -^  generates 
direct  tests  of  differences  in  slope  across  forecast  sources. 
For  example,  if  6-^  is  the  vec;tor  of  five  s]ope  coefficients,  one 
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for  each  forecast  source,  and  if  C12'  is  the  vector  (1,-1,0,0,0), 

then  Ci2'Bx  estimates  the  difference  in  slopes  between  the  first 
and  second  sources,  with  standard  error: 


S  ,  o  =  s 
c  '  6     V 


- 1   1/2 
[c'(X'X)g  e]'^'' 


(3.18) 


In  (3.18),  Sy  is  the  residual  standard  error  from  the  joint 
estimation  of  equation  (3.17).   (X'X)-lg  jg  the  lower-right 
submatrix  of  five  rows  and  five  columns  from  the  (X'X)-l  matrix 
of  equation  (3.17).   This  submatrix  determines  the  variance- 
covariance  relations  among  the  five  slope  coefficients. 

Linear  constraints  of  the  form  of  C12  are  used  to  evaluate 
differences  in  the  slope  coefficients,  i.e.  differences  in  the 
association  between  excess  returns  and  forecast  errors,  across 
forecast  sources.   The  results  from  estimation  of  equation  (3.17) 
are  in  Tables  8  through  11,  and  are  discussed  in  section  5. 

4^ D§.ta_Descr  j;2ii2Q. 


The  forecast  data  are  from  the  Institutional  Brokers 
Estimate  System,  or  I/B/E/S,  a  database  developed  by  the  Lynch, 
Jones  and  Ryan  brokerage  house.   T)ie  database  consists  of 
estimates  from  between  50  and  130  brokerage  houses,  each 
employing  many  analysts.   EPS  are  forecast  for  between  1000  and 
2500  firms,  depending  on  the  month  and  year  in  question.   From 
each  brokerage  house,  at  most  one  analyst  is  reported  predicting 
EPS  for  any  given  firm.   The  data  used  in  this  paper  are 
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Individual  analysts'  forecasts,  conpiled  from  July  1975  through 
September  1982.14 

The  database  is  updated  on  or  around  the  25th  of  each  month. 
From  the  updated  list  of  forecasts,  summary  statistics  such  as 
the  mean,  the  median,  the  standard  deviation,  the  number 
increasing,  etc.,  are  computed.   The  summary  statistics  are  then 
sold  to  client:s,  primarily  institutional  Investors. 

COMPUSTAT  is  the  source  of  earnings  data  and  most  earnings 
announcement  dates.   The  remaining  announcement  dates  are  from 
the  Wall  Street  Journal  (WSJ)  and  its  Index.   Stock  return  data 
and  the  trading  day  calendar  are  from  the  CRSP  Daily  Returns 
file.   Data  on  stock  splits  and  stock  dividends  are  from  the  CRSP 
Monthly  Master  file. 

The  initial  sample  comprises  the  set  of  firms  in  the  I/B/E/S 
database  with  December  yearends ,  and  with  forecasts  available  for 
each  year  from  1975  through  1981.   This  set  contains  508  firms, 
and  3556  firm-years.   Firm-years  are  excluded  if  the  annual 
earnings  are  not  available  on  COMPUSTAT,  or  if  all  four  quarterly 
earnings  announcement  dates  are  not  available  from  COMPUSTAT  or 
the  WSJ.   This  reduces  the  sample  to  457  firms,  with  3440  firm- 
years.   The  estimation  requirements  of  the  quarterly  models,  30 
continuous  quarters  of  data  prior  to  1975-IV,  further  reduce  the 
sample  to  190  firms,  with  1284  firm-years.   The  final 
requirement,  listing  on  CRSP,  reduces  the  sample  to  184  firms 
with  1260  firm-years. 

Analysis  occasionally  produce  forecasts  for  fully-diluted. 
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rather  than  primary,  EPS.   Primary  EPS  are  earnings  divided  by 
the  average  number  of  outstanding  common  shares,  including  common 
stock  equivalents  such  as  stock  options  and  warrants.   Fully- 
diluted  EPS  are  earnings  divided  by  the  average  number  of 
outstanding  common  shares,  computed  "to  show  the  maximum 
potential  dilution  of  current  EPS"  (APB  Opinion  No.  15), 
including  in  the  denomimator  such  things  as  contingently  issuable 
common  stock.   Forecasts  of  fully-diluted  EPS  are  converted  to 
primary,  using  the  ratio  of  fully-diluted  to  primary  EPS  for  that 
firm  and  year  from  COMPUSTAT .   Forecasts  are  also  adjusted  for 
changes  in  capitalization  which  are  announced  between  the 
forecast  date  and  the  annual  earnings  announcement  date. 

Forecasts  for  each  firm  and  year  are  selected  at  five  fixed 
horizons  of  less  than  one  year  in  duration.   They  are:   240,  180, 
120,  60  and  5  trading  days  prior  to  the  announcement  of  annual 
earnings.   The  first  four  horizons  correspond  roughly  to  dates 
following  each  of  the  year's  quarterly  earnings  announcements; 
the  fifth  is  immediately  prior  to  the  annual  announcement.   For 
example,  a  horizon  of  240  trading  days  will  usually  correspond  to 
a  date  after  the  previous  year's  annual  announcement,  and  before 
the  current  year's  first  quarter  announcement.   A  horizon  of  180 
trading  days  will  typically  correspond  to  a  date  between  the 
first  quarter  announcement  and  the  second  quarter  announcement; 
and  so  on  . 

The  motivation  for  selecting  horizons  corresponding  to  dates 
in  different  fiscal  quarters  is,  first,  to  observe  whether  there 
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is  an  evolution  of  expectations  over  th«  course  of  the  year,  and 
second,  to  ensure  that  the  horizons  differ  in  a  well-defined, 
observable,  and  potentially  relevant  respect:   the  anount  of 
quarterly  earnings  data  available. 15 

Given  a  horizon  T.  firm  j,  and  year  t,  the  selected 
forecasts  are  the  most  recent  available  fron  each  brokerage  house 
Issuing  a  forecast  for  firm  j  in  year  t.   I  use  the  dates 
assigned  to  forecasts  by  analysts,  not  the  dates  of  the  I/B/E/S 
file  in  which  they  first  appear.   The  lag  between  the  analyst's 
date  for  a  forecast  and  the  date  of  its  first  appearance  on 
I/B/E/S  averages  34  trading  days,  and  has  a  standard  deviation  of 
44.5  trading  days.   Thus,  sone  recently-updated  forecasts  are 
omitted  from  each  monthly  listing  by  I/B/E/S.   Since  published 
summary  statistics  are  computed  from  these  monthly  lists,  the 
summary  statistics  also  fail  to  reflect  some  recent  updates. 

Using  analysts'  dates  instead  of  I/B/E/S  "publication"  dates 
results  in  a  realignment  of  the  data,  with  fewer  omissions  of 
recently  updated  forecasts.   Previous  studies  of  analysts' 
forecasts  have  used  the  publication  date,  not  the  analyst's 
forecast  date,  to  select  forecasts.   For  example.  Fried  and 
Givoly  (1982)  and  Givoly  and  Lakon.ishok  (1982)  select  their 
samples  based  on  the  publication  date  of  the  Standard  &  Poors' 
Earnings  Forecaster,  their  source  of  forecast  data.   Fried  & 
Givoly  go  on  to  use  analysts'  dates  within  that  sample  to 
distinguish  new  and  old  forecasts.   Brown  and  Rozeff  (1978)  and 
Brown,  Foster  and  Noreen  (1984)  use  datasets  for  which  individual 
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analysts'  forecast  dates  are  not  available.   Using  publication 
dates  instead  of  forecast  dates  probably  biases  results  against 
analysts,  by  failing  to  include  some  recent  updates  of  forecasts. 

Once  the  set  of  available  forecasts  is  selected  for  each 
firm  j,  year  t  and  horizon  T,  I  find  the  mean,  the  median,  and 
the  most  current  of  this  set  to  proxy  for  the  consensus. 

The  quarterly   models  (2.3)  and  (2.4)  are  estimated  for  each 
firm,  for  each  quarter  from  1975-1  through  1981-IV.   Parameter 
estimates  are  updated  each  quarter,  using  the  previous  thirty 
quarters'  observations.   Observations  are  adjusted  for  changes  in 
the  number  of  outstanding  shares.   Annual  forecasts  are 
constructed  from  quarterly  forecasts  by  summing  the  appropriate 
realizations  and  forecasts.   For  example,  in  quarter  3,  there 
have  been  two  realizations  of  quarterly  earnings  for  the  year, 
and  two  quarters  remain  to  be  forecast.   The  annual  forecast  from 
a  quarterly  model  for  quarter  3  is  the  sum  of  actual  earnings  for 
quarters  1  and  2,  and  forecasts  for  quarters  3  and  4. 

Because  of  a  small  number  of  untraceable  Influential 
observations  which  altered  the  regression  results,  I  imposed  an 
arbitrary  censoring  rule  on  the  data  as  well:   all  forecast 
errors  larger  than  $10.00  per  share  in  absolute  value  were 
deleted  from  the  sample.   Since  typical  values  for  EPS  numbers 
are  in  the  range  of  $1.00  to  $5.00  per  share,  errors  of 
sufficient  magnitude  to  be  deleted  are  rare,  and  suggest  a  data 
entry  or  transcription  error.   This  censoring  rule  does  not 
eliminate  all  observations  which  appear  as  outliers  on  data 
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plots,  since  the  error  transformations  described  in  equations 
(3.2)  and  (3.3)  can  have  denoninators  close  to  zero. 

Excess  returns  are  estinated  from  a  daily  market  model  in 
logarithmic  form: 


ln(l  -  R.  )  =  a.  +  8.  ln(l  +  R„  )  +  £  . 
js      J     J  Ms'     js 


(4.1) 


where  Rjg  j^  ^^^  return  to  security  j  on  day  s,  R^s  is  the  return 
on  the  CRSP  equally-weighted  market  portfoJio  of  securities  on 
day  s,  and  In  denotes  the  naturaj  logarithm  transformation . 16 

The  parameters  of  (3.14)  are  estimated  for  each  firm  in  the 
study  using  200  trading  days  of  data  at  a  time,  beginning  in  July 
1974.   Estimated  parameters  are  used  to  predict  ahead  100  trading 
days,  and  eTtcess  returns  are  the  difference  between  the 
realization  ln(l.+  Rjg)  and  the  prediction  based  on  (3.14). 
After  each  iteration  of  estimation  and  prediction,  the  estimation 
period  is  rolled  forward  by  100  trading  days,  and  new  parameters 
are  estimated. 

The  estimation  procedure  produces  a  stream  of  daily  excess 

returns,  Ejs.   The  cjs  are  cumulated  over  each  forecast  horizon, 
from  the  horizon  date  through  the  announcement  date  of  annual 
EPS,  to  form  Uj^.j   ^jjg  measure  of  ^new  information  arriving  over 
horizon  t  in  year  t  for  firm  j. 


Results 


The  fundamental  difference  between  the  most  current  analyst 
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forecast  as  a  consensus  definition  and  either  the  mean  or  the 
median,  is  that  the  former  is  constructed  using  the  forecast 
date,  while  the  latter  two  are  not.   In  Table  1,  the 
distributions  of  forecast  ages  are  compared  for  the  three 
alternative  definitions  of  analyst  consensus.   The  age  of  a 
forecast  is  defined  to  be  the  difference,  measured  in  trading 
days,  between  the  forecast  date  and  the  horizon  date  chosen  for 
this  study.   More  generally,  it  can  be  considered  the  lag  between 
the  forecast  date  and  an  event  date  of  interest  to  the 
researcher  . 

For  each  firm,  year,  and  horizon,  the  consensus  forecasts 
are  computed  from  the  set  of  available  analysts'  forecasts.   The 
set  of  available  forecasts  contains  the  last  forecast  from  each 
analyst,  produced  prior  to  the  horizon  date.   In  Table  1,  the 
ages  of  the  mean  and  median  forecasts  are  defined  to  be, 
respectively,  the  mean  and  the  median  of  the  ages  of  the 
forecasts  in  the  set  of  available  forecasts  for  each  firm,  year 
and  horizon.   The  distribution  described  in  Table  1  is  over  all 
firms  and  years,  for  each  horizon. 

As  expected  from  its  definition,  the  most  current  forecast 
has  a  distribution  of  ages  much  closer  to  zero  than  either  the 
■ean  or  median.   For  the  four  longer  horizons  (240,  180,  120,  and 
60  trading  days),  over  fifty  percent  of  the  most  current 
forecasts  are  less  than  five  trading  days  old.   By  contrast,  over 
fifty  percent  of  the  mean  or  median  ages  at  all  horizons  are  more 
than  forty  trading  days.   While  some  of  these  older  forecasts  may 
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represent  circumstances  where  little  new  information  has  arrived. 
so  there  was  no  need  to  update,  the  accuracy  results  which  follow 
suggest  that  this  is  not  always  the  case. 

In  Table  2.  the  bias  results  are  presented.   The  reported 
numbers  are  the  forecast  bias,  computed  as  in  equation  (3.7),  and 
a  t-statistic  to  test  the  null  hypothesis  of  no  bias.   Results 
are  given  for  three  different  definitions  of  the  forecast  error, 
and  the  results  vary  considerably  across  definitions. 

Generally,  the  unsealed  forecast  error  (equation  (3.1))  and 
the  forecast  error  scaled  by  average  absolute  EPS  changes 
(equation  (3.2)),  in  Panels  A  and  B,  exhibit  statistically 
significant  negative  bias.   The  forecast  error  measured  as  a 
percent  of  prior  EPS,  in  Panel  C,  exhibits  mixed  results,  mostly 
positive  or  zero.   Of  the  three  analyst  consensus  measures,  the 
median  uniformly  exhibits  the  smallest  bias,  usually 
indistinguishable  from  zero. 

Negative  bias  corresponds  to  overestimates  of  EPS.   Negative 
bias  in  analysts'  forecasts  is  consistent  with  some  conventional 
wisdom,  which  says  that  analysts  prefer  to  make  optimistic 
predictions  and  "buy"  recommendations,  to  maintain  good  relations 
with  management  .  17   The  evidence  i,s  weak,  however,  in  two 
respects.   First,  the  negative  bias  result  is  very  sensitive  to 
the  definition  of  forecast  errors.   The  median  analyst  forecast 
appears  to  be  unbiased  under  all  transformations  of  the  forecast 
error.   The  percent  transformation,  reported  in  Panel  C,  gives 
mixed  results  which  would  not  lead  to  a  general  conclusion  of 
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negative  bias.   Second,  in  Panels  A  and  B,  when  the  estimates 
from  analysts  are  significantly  negative,  they  are  statistically 
indistinguishable  from  those  from  mechanical  time-series  models. 
The  motive  of  maintaining  good  relations  with  management  cannot 
be  ascribed  to  these  models.   Support  for  the  contention  that 
analysts  preferentially  issue  optimistic  forecasts  is  at  best 
weak  . 

An  alternative  explanation  which  is  also  consistent  with 
these  results  is  that  analysts  issue  unbiased  forecasts,  but  this 
seven-year  period,  1975  through  1981,  is  one  with  primarily 
negative  unanticipated  EPS.   Unfortunately,  the  way  to 
distinguish  between  the  hypothesis  of  deliberate  optimistic  bias 
and  this  alternative  is  to  collect  data  for  a  longer  span  of 
years . 

Tables  3  through  7  summarize  the  results  on  forecast 
accuracy.   The  average  absolute  error,  computed  as  described  in 
equation  (3.12)  for  each  forecast  source,  appears  in  Table  3. 
Average  squared  errors,  from  equation  (3.13),  are  in  Table  4. 
Table  5  contains  t-statistics  testing  pairwise  differences  in 
accuracy  among  analysts.   Tables  6  and  7  contain  t-statistics 
testing  pairwise  differences  in  ac.curacy  between  analysts  and  the 
quarterly  models. 

Both  Table  3  and  Table  4  display  a  consistent  pattern  of 
increasing  forecast  accuracy  as  the  earnings  announcement  date 
approaches,  for  all  forecast  sources.   That  is,  average  absolute 
and  squared  error  decline  uniformly  as  the  year  progresses,  for 
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analysts  and  for  quarterly  models.   This  is  consistent  with 
forecasts  incorporating  sone  new  infornation  relevant  to  the 
prediction  of  EPS  over  the  course  of  the  year. 

In  Tables  3  and  4  it  appears  that  the  most  current  analyst 
is  no  worse  "than  the  other  sources,  and  that  analysts  dominate 
quarterly  models  in  the  longer  liorizons.   The  reader  of  Tables  3 
and  4  should  avoid  using  the  heuristic  test  of  counting  the 
number  of  times  that  the  most  current  forecaster  is  most  accurate 
to  assess  these  differences.   The  relative  performance  results 
are  highly  correlated,  both  across  horizons  and  across 
definitions  of  the  forecast  error.   The  results  of  a  statistical 
test  for  the  differences  in  accuracy  which  are  suggested  by  a 
perusal  of  Tables  3  and  4  appear  in  Tables  5  through  7. 

In  Tables  5  through  7,  a  positive  difference  means  that  the 
first  of  the  pair  is  less  accurate.   Table  5  contains  the  results 
of  pairwise  comparisons  among  the  three  analyst  consensus 
definitions.   In  terms  of  absolute  error,  which  is  reported  in 
Panel  A,  when  the  differences  are  significant  they  favor  the  mean 
over  the  median  and  the  most  current  forecaster  over  either  the 
mean  or  the  median.   In  Panel  B,  where  differences  in  squared 
error  accuracy  are  presented,  there  are  no  measurable  differences 
in  accuracy  among  the  three  analyst  consensus  definitions. 

The  results  reported  in  Tables  6  and  7  indicate  that  analyst 
forecasts  generally  dominate  the  time-series  models  at  the  longer 
horizons.   For  the  240,  180  and  1  20- 1 rad i ng-day  horizons, 
wherever  differences  are  statistically  significant,  the  results 
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favor  analysts  over  the  quarterly  models.   This  evidence  is 
consistent  with  analysts  using  a  broader  information  set  than  can 
be  exploited  by  a  univariate  model. 

At  the  60-trading-day  horizon,  however,  the  quarterly  time- 
series  models  dominate  the  mean  and  the  median  analyst  forecast 
In  all  comparisons  where  significant  differences  exist.   The  most 
current  forecast  is  never  dominated  to  a  statistically 
significant  extent  by  the  quarterly  models,  but  generally  is 
indistinguishable  from  them. 

The  accuracy  results  reported  in  Tables  3  through  7  are 
mixed.   Often,  differences  in  accuracy  between  forecast  sources 
are  statistically  insignificant.   Where  they  are  statistically 
significant,  however,  the  results  generally  conform  with 
expectations.   The  most  current  forecaster  is  at  least  as  good 
as,  and  by  some  measurements  dominates,  the  mean  or  median.   This 
is  consistent  with  the  most  current  forecast  impounding 
information  from  previously  released  forecasts.   The  results 
reported  in  here  probably  understate  the  difference  between  the 
most  current  forecast  and  the  mean  and  median  definitions  which 
appear  in  most  other  published  work.   The  sample  used  here  was 
selected  to  eliminate  the  " public a^t ion  lag"  which  is 
characteristic  of  datasets  which  use  publication  dates,  rather 
than  analysts'  dates,  to  select  forecasts. 

Analysts  are  generally  at  least  as  good  as,  and  in  longer 
time  horizons  dominate,  univariate  time-series  models.   This  is 
consistent  with  analysts  incorporating  a  broader  information  set 
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into  tfaeir  forecasts  than  simply  the  univariate  time-series. 
Again,  the  results  here  may  understate  differences.   Since  the 
sample  of  firms  is  reduced  sharply  by  the  data  requirements  of 
the  time-series  models,  the  selection  process  may  exclude  firms 
where  analysts'  information  advantage  is  largest. 

Tables  8  through  11  contain  the  results  from  estimation  of 
model  (3.17),  to  test  for  positive  association  between  forecast 
errors  and  excess  returns.   The  method  used  is  to  regress  the 
forecast  error,  or  one  of  the  trjinsf ormat ions  of  the  forecast 
error,  on  the  cumulative  excess  returns  over  the  forecast 
horizon,  and  to  test  for  a  significant  positive  slope. 18   Table  8 
contains  adjusted  R2  and  the  F-statistic  on  the  full  model,  for 
each  forecast  horizon,  along  with  information  on  sample  sizes  and 
number  of  estimated  parameters.   Table  9  contains  incremental  F- 
tests  for  parts  of  the  model  separately.   In  Table  10,  the 
estimated  slope  coefficients  and  their  associated  t-statistics 
are  reported,  and  in  Table  11  the  tests  for  differences  In  the 
estimated  slope  coefficients  across  forecast  sources  are 
reported . 

According  to  results  reported  in  Table  8,  equation  (3.17) 
explains  between  9*  and  16%  of  the,  variation  in  forecast  errors, 
with  slight  variations  across  horizons  and  across  definitions  of 
the  dependent  variable.   The  largest  adjusted  R2  appears  at  the 
5- trad ing-day  horizon,  though  differences  in  explanatory  power  of 
model  (3.17)  across  horizons  are  slight.   The  weakest  results 
occur  when  the  percent  transformation  of  forecast  errors  is  the 
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dependent  variable,  but  again  differences  are  not  large  across 
transformations  of  the  error.   All  variants  of  the  model  have 
statistically  significant  explanatory  power,  according  to  the 
regression  F-stat ist ics . 

The  incremental  F-statistics  in  Table  9  confirm  that  the 
year-specific  effects  are  important  in  equation  (3.17).   The  F- 
statistic  on  the  year-specific  effects  tests  the  null  hypothesis 


"O  =  "2ilT  =  "2i2T 


a 


21TT 


0 


(5.1) 


That  is,  the  F-statistic  tests  the  null  hypothesis  that 
estimation  of  year-specific  intercepts  adds  no  explanatory  power 
to  the  model.   This  hypothesis  is  rejected  at  the  .05  level  at 
all  horizons,  and  at  the  .001  level  or  better  at  the  horizons 
longer  than  5  days.   The  importance  of  year  effects  in  the  model 
increases  with  the  length  of  the  horizon.   This  is  consistent 
with  the  information-based  explanation  for  their  inclusion  in  the 
model,  namely  that  forecast  errors  impound  time-period-specific 
unanticipated  information.   Over  longer  horizons,  loosely 
speaking,  the  "quantity"  of  unanticipated  information  is  greater. 
The  strength  of  this  result  also  confirms  the  assertions  made 
earlier  that  a  c ros s -sec t i on  of  forecast  errors  for  a  single  time 
period  is  not  a  set  of  independent  observations. 

The  F-statistics  on  f i rm- spec i f i c  effects  reported  in  Table 
9  also  reject  the  null  hypothesis,  which  is: 
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There  are  measureable  firm-specific  differences  in  average 
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forecast  error  at  all  horizons,  even  after  the  adjustnent  for 
firm-specific  information  inpounded  in  excess  returns.   The 
strength  of  this  result  varies  little  across  forecast  horizons 
and  across  transformations  of  the  dependent  variable. 

The  importance  of  the  slope  coefficients  in  model  (3.17), 
indicated  by  the  F-statisic  reported  in  Table  9,  varies  across 
forecast  horizons  and  across  transformations  of  the  dependent 
variable.   The  individual  slope  coefficients  reported  in  Table 
10,  however,  are  of  greater  relevance.   Generally,  the  results  in 
Table  10  show  a  pattern  of  positive  association  between  forecast 
errors  and  excess  returns.   A  positive  association  is  expected 
if:  (1)  there  is  some  overlap  between  Information  relevant  to 
firm  value  and  information  relevant  to  current-year  earnings,  and 
(2)  some  of  this  overlapping  information  is  unanticipated  both  by 
investors  and  by  the  predictor  of  EPS. 

The  statistical  significance  of  the  positive  association 
varies  with  the  definition  of  the  dependent  variable,  and  with 
the  forecast  horizon.   The  strongest  results  are  found  for  the 
unsealed  forecast  error,  and  for  the  120-day  horizon.   Among  the 
analyst  consensus  forecasts,  the  strongest  results  are  generally 
for  the  most  current  forecaster,  w^hich  is  consistent  with  the 
most  current  forecaster  acting  as  a  reasonable  composite  of 
analysts'  information.   The  strongest  results,  and  the  only  ones 
which  are  consistently  positive  and  statistically  significant, 
however,  are  for  the  quarterly  au toregress i ve  model,  equation 
(2.3).   This  pattern  of  relative  performance  does  not  vary 
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substantially  across  horizons  or  across  transformations  of  the 
dependent  variable.   This  result  is  anomalous,  especially  in 
light  of  the  quarterly  model's  relative  inaccuracy.   It  indicates 
that  forecast  errors  from  a  quarterly  autoregressive  model  are 
more  highly  correlated  with  excess  returns  than  are  forecast 
errors  from  analysts  forecasts. 

Table  11  contains  the  results  of  statistical  tests  for  the 
differences  in  association  observed  in  Table  10.   Results  are 
shown  for  tests  of  pairwise  differences  between  the  quarterly 
autoregressive  model  and  all  other  forecast  sources,  and  for 
differences  between  the  most  current  analyst  and  the  mean  and 
median  analyst  forecasts.   The  statistical  tests  confirm  the 
anomalous  result,  that  errors  from  a  mechanical  quarterly  model 
often  are  significantly  more  closely  related  to  excess  returns 
than  errors  from  analysts.   In  addition,  the  tests  indicate  that 
while  the  most  current  forecast  typically  shows  the  strongest 
result  among  analyst  consensus  forecasts,  the  difference  is  not 
statistically  significant,  in  general. 


6^ SummarY_and_Concl usions 


Analysts'  predictions  of  EPS  are  a  potential  source  of 
"market  expectations"  information.   I  have  examined  properties  of 
different  composites,  or  consensus  forecasts.   Results  reported 
here  indicate  that  the  most  current  forecast  from  a  set  of 
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analysts'  forecasts  is  a  reasonable  aggregation  of  the 
information  in  the  set. 

Five  alternate  sources  of  forecasts  are  examined:   the  mean, 
median,  and  most  current:  from  a  set  of  analysts'  forecasts,  an 
autoregressive  model  in  fourth  differences  of  the  univariate 
series  of  quarterly  EPS.  and  a  fourth-differenced  random  walk 
using  quarterly  EPS.   The  two  quarterly  time-series  models  are 
included  primarily  as   benchmarks. 

The  most  current  forecast  is  at  least  as  accurate  as  either 
the  mean  or  median  forecast,  and  generally  dominates  them  in 
absolute  error  terms.   This  result  indicates  that  the  date  of  the 
forecast  is  relevant  for  determining  its  accuracy,  and  dominates 
"diversification"  obtained  by  aggregating  forecasts  from 
different  sources.   Since  most  published  aggregations  of 
forecasts  and  most  previous  research  treat  all  forecasts  as  if 
they  are  equally  current,  they  ignore  this  relevant  piece  of 
information.   In  this  sample  the  forecast  error  from  the  most 
current  forecast  is  more  closely  associated  with  excess  returns 
over  the  forecast  horizon  than  the  error  from  the  mean  or  the 
median  forecast,  but  the  difference  In  association  is  not,  in 
general,  statistically  significant. 

Analysts  generally  are  significantly  more  accurate  than 
time-series  models.   Errors  from  the  quarterly  autoregressive 
model,  however,  appear  to  be  more  closely  related  with  excess 
returns  over  the  forecasting  horizon  than  those  of  analysts, 
however.   Because  of  this  anomalous  result,  it  is  unclear  that 
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analysts  provide  a  better  model  of  the  "market  expectation"  than 
mechanical  models. 

It  should  be  noted,  though,  that  the  sample  of  firms  was 
reduced  sharply  by  the  data  requirements  of  the  time-series 
models.   This  sample,  with  its  selection  bias  toward  longer-lived 
firms  with  continuous  data  available,  does  not  clearly  isolate 
cases  where  analysts  might  be  expected  to  have  most  advantage 
over  mechanical  models,  and  perhaps  eliminated  many  of  these 
cases.   These  firms,  where  there  is  a  substantial  amount  of  non- 
earnings  information  expected  to  have  an  impact  on  earnings,  may 
be  a  fruitful  area  for  future  investigation. 
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Table  1 

Selected  Characteristics 
of  the  Distribution  of  Forecast  Ages,l 
for  the  Mean,  Median  and  Most  Current  Analyst  Forecasts, 2 

for  Five  Forecast  Horlzons3 


Horizon 

__Ira 

cti^es 

_Sam 

Source 

.  1 

.25 

.5 

.  75 

.  9 

mean 

mean 

240 

34 

46 

60 

74 

89 

61  .  5 

180 

33 

43 

58 

7  3~ 

91 

59.9 

120 

34 

47 

62 

82 

102 

65.9 

60 

38 

51 

67 

88 

110 

71  .7 

5 

40 

52 

67 

87 

109 

72  .3 

median 

240 

7 

20 

51 

83 

127 

59.4 

180 

5 

16 

41 

78 

131 

56  .  6 

120 

7 

19 

47 

80 

135 

61  .1 

60 

7 

19 

49 

88 

155 

66.6 

5 

15 

26 

43 

71 

131 

61  .7 

current 

240 

1 

2 

4 

14 

43 

14.4 

180 

1 

1 

3 

9 

24 

9.0 

120 

1 

2 

4 

11 

25 

10.0 

60 

1 

1 

4 

10 

26 

10  .  6 

5 

1 

5 

11 

20 

34 

15.  8 

Sa52l«_Moments 

stdev  skew    N 


24.7  0.8  1198 
23.9  0.8  1235 

28.1  1.0  1254 

31.8  1.6  1258 

31.2  2.5  1260 

47.9  1.1  1198 
54.5  1.5  1234 
58.0  1.8  1254 

67.7  2.0  1258 

60.8  2.6  1260 

24.7  3.3  1198 

16.9  4.8  1233 

18.8  6.3  1254 
22.5  6.8  1259 
22.0  8.9  1259 


Notes 


The  age  of  a  forecast  is  the  number  of  trading  days  between  the 
analyst's  forecast  date  and  the  horizon  date  chosen  in  this 
study.   For  the  consensus  sources,  age  is  defined: 

mean  -  the  mean  of  the  ages  of  available  analyst  forecasts 
median  -  the  median  of  the  ages^  of  available  analyst 
forecasts  . 
current  -  the  age  of  the  most  recent  analyst's  forecast. 
'The  forecast  sources  are: 

mean  -  the  mean  of  available  analysts'  forecasts, 
median  -  the  median  of  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 
^The  forecast  horizons  are  measured  in  trading  days  prior  to  the 
annual  earnings  announcement. 
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Table  2 


Forecast  Bias ,  1 

for  Five  Forecast  Sources, 2  at  Five  Forecast  HorizonsS 

( t-s ta t i s t i cs4  are  in  parentheses) 


Panel  A : 

Unsealed  error 
[Equation 
(3.1)] 


Panel  B: 

Scaled  error 
[Equation 
(3.2)] 


Horizon 
Source  : 
q  .  a  .  r  . 


q  .  r  .  w 


mean 


median 


current 


q  .  a  .  r  . 


q  .  r  .  w 


mean 


median 


current 


240 

•0  .  15 

3.82) 

■0  .  09 
-2.17) 

-0  .  08 
-2.09) 

0  .  03 
-0.72) 

-0  .  08 
-1.91) 


■0  .  24 
-3.69) 

-0  .  14 
-2.24) 

-0.17 
-2.58) 

0  .  09 
-1.38) 

-0.18 
-2.84) 


180 

-0.11 

(-3.15) 

-0  .  05 
(-1.49) 

-0.12 
(-3.57) 

-0  .  05 
(-1 .52) 

-0  .  10 
(-2.93) 


-0.12 
(-2.41) 

-0  .  03 
(-0 . 66) 

-0  .  20 
(-4.05) 

-0  .  09 
(-1.89) 

-,0.17 
(-3.48) 


120 

-0  .  14 

(-5. 18) 

-0.11 
(-4.24) 

-0.12 
(-4.70) 

-0  .  06 
(-2.25) 

-0  .  09 
(-3.27) 


-0  .  20 
(-4.70) 

-0.15 
(-3.57) 

-0.18 
(-4.20) 

-0  .  08 
(-1 .83) 

-0.13 
(-3.01) 


60 

-0  .  08 
4.11) 

-0  .  06 
-3.  25) 

-0  .  09 
-4  .  96) 

-0  .  02 
-1.19) 

-0  .  04 
-2.34) 


■0.11 
■3.59) 

0  .  08 
■2.56) 

0  .  14 
■4  .35) 

0  .  03 
■0  .  94  ) 

■0  .  08 
■2  .  60) 


-0  .  05 

(-2.74) 

0  .  02 
(1 .00) 

-0.  02 
(-1.42) 


-0  .  07 
( -2 . 08) 

0  .  03 
(1  .00) 

-0  .  04 
(-1.36) 
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Table  2  (Continued) 


Horizon: 

240 

180 

120 

60 

5 

Source ; 

anel  C: 

q . a . r. 

0.05 

0.05 

.00 

-0.01 

%   error 

(1.90) 

(2.14) 

(-0.31) 

(-0.69) 

[Equation 

(3.3)] 

q. r . w. 

0.06 

0.07 

0.03 

0  .01 

(2.24) 

(3.25) 

(1.78) 

(1.54) 

mean 

-0.01 

-0.02 

-0.02 

-0.02 

-0.02 

(-0.52) 

(-0.60) 

(-1.61) 

(-2.72) 

(-2.55) 

median 

.00 

.  00 

-0.01 

-0.01 

.00 

(0  .07) 

(0.05) 

(-0.37) 

(-0.70) 

(0.21) 

current 

-0.01 

-0.01 

-0.  01 

-0.01 

-0.  01 

(-0.34) 

(-0.47) 

(-0.94) 

(-1.31) 

(-1.11) 

Notes: 


^  T  h  e 
^The 


^The 
^The 


cniTtTiutat  i  on 

descr i  bed 
forecast  so 


q 
q 


r  .  w . 


mean 
median 
current 
forecast  ho 
annual  earn 
degrees  of 
1 , 000  ,  so  t 
the  . 05  and 
1 . 96  and  2 . 


of  forecast  bias  and  its  associated  t-statistic  are 
in  equations  (3.6)  through  (3.9)  in  the  text, 
urces  are: 

quarterly  autoregress i ve  model  in  fourth  differences; 
quation  (2.3)  in  the  text. 

random  waJk  model  with  drift  in  fourth  differences; 
quation  (2.4)  in  the  text. 

he  mean  of  the  available  analysts'  forecasts, 
he  median  of  the  available  analysts'  forecasts, 
he  most  recent  forecast  from  an  analyst, 
rizons  are  measured  in  trading  days  prior  to  the 
ings  announcement. 

freedom  for  all  reported  t-statistics  are  over 
hey  are  approximately  normal.   For  a  two-sided  test, 

.01  critical  points  of  the  N(0,1)  distribution  are 
58,  respectively. 
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Table  3 


Forecast  Accuracy: 

Average  Absolute  Forecast  Errorl 

for  Five  Forecast  Sources, 2  and  Five  Forecast  Horizons3 


Dependent 

Hor i  zon : 

240 

] 

L80 

120 

60 

5 

Variable: 

Source  : 

q  .  a  .  r  . 

0 

.  975 

0  . 

.  780 

0, 

.592 

0  . 

350 

Unsca  led 

q  .  r  .  w  . 

0 

.  963 

0 

.  781 

0 

.620 

0  , 

.  363 

error 

mean 

0 

.  747 

0 

.  645 

0 

.  516 

0. 

395 

0. 

291 

[  Equation 

median 

0 

.788 

0 

.677 

0 

.546 

0 

.435 

0 

.  321 

(3.1)] 

current 

0 

.  742 

0 

.610 

0 

.468 

0  , 

.342 

0. 

.292 

q  .  a  .  r  . 

.  478 

1 

.  112 

0 

.  867 

0 

.505 

Scaled 

q  .  r  .  w  . 

.  494 

1 

.  121 

0 

.  919 

0. 

.523 

error 

mean 

.  147 

0 

.  990 

0 

.777 

0 

.616 

0 

.467 

[ Equation 

median 

.  208 

1 

.  035 

0 

.833 

0  , 

,677 

0  . 

.  516 

(3.2)] 

current 

.  181 

0 

.958 

0 

.733 

0 

.  555 

0 

.492 

q  .  a  .  r  . 

0 

.337 

0 

.  273 

0 

.  191 

0  , 

,  115 

%    error 

q  .  r  .  w  . 

0 

.  335 

0 

.273 

0 

.212 

0 

.  124 

[Equation 

mean 

0 

.  244 

0 

.  212 

0 

.  164 

0  . 

,  121 

0  , 

.090 

(3.3)] 

median 

0 

.  255 

0 

.  219 

0 

.  173 

0 

.  132 

0 

.096 

current 

0 

.242 

0 

.  202 

0 

.  154 

0 

.  104 

0, 

.088 

Notes 

iThe 

^The 


'The 


computation  of  average  absolute  error  is  described  in  equations 
(3.10)  and  (3.12)  in  the  text, 
forecast  sources  are: 


q.a.r.  -  a  quarterly  autoregress i ve  model  in  fourth  differences; 

equation  (2.3)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2.4)  in  the  te,xt. 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst, 
forecast  horizons  are  measured  in  trading  days  prior  to  the 
annual  earnings  announcement. 
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Table  4 


Forecast  Accuracy: 
Average  Squared  Forecast  Errorl 
for  Five  Forecast  Sources, 2  and  Five  Forecast  HorizonsS 


Dependent 

Horizon: 

240 

180 

120 

60 

5 

Variable: 

Source: 

q  .  a  .  r  . 

2.771 

1.877 

1  .203 

0. 

.460 

Unsealed 

q  .  r  .  w  . 

2  .626 

1.897 

1  .255 

0. 

.447 

error 

nean 

1  .531 

1.239 

0.818 

0  . 

.478 

0  . 

351 

[Equation 

nedian 

1  .595 

1  .246 

0.790 

0 

.546 

0 

.409 

(3.1)] 

current 

1  .638 

1  .  178 

0.  669 

0. 

.389 

0  . 

.361 

q  .  a  .  r  . 

6.702 

3  .042 

2.719 

0 

.788 

Scaled 

q  .  r . w. 

6.482 

2  .  950 

2.724 

0. 

.793 

error 

nean 

3.450 

2  .  810 

2.066 

1 

.458 

1 

.083 

[Equation 

median 

3.615 

2  .857 

2  .035 

1 

.  529 

1 

.146 

(3.2)] 

current 

4.766 

3.245 

1  .896 

1 

.466 

1 

.257 

q  .  a  .  r  . 

1  .694 

0  .914 

0  .247 

0 

.  103 

%    error 

q . r . w. 

1.479 

1.107 

0.481 

0 

.  144 

[Equation 

mean 

0.456 

0.  314 

0.  163 

0 

.066 

0 

.  052 

(3.3)] 

med  ian 

0  .  455 

0.326 

0  .  146 

0 

.073 

0 

.  052 

current 

0.468 

0.  316 

0.  206 

0 

.045 

0 

.  037 

Notes : 


^The 
2The 


>The 


computation  of  average  absolute  error  is  described  in  equations 
(3.11)  and  (3.13)  in  the  text, 
forecast  sources  are: 
q.a.r.  -  a  quarterly  autoregressive  model  in  fourth  differences 

equation  (2.3)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2.4)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst, 
forecast  horizons  are  measured  in  trading  days  prior  to  the 
annual  earnings  announcement. 
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Table  5 


Pairwise  Differences  in  Forecast  Accuracyl 

Among  the  Mean,  Median  and  Most  Current  Analyst  Forecasts, 2 

for  Five  Forecast  Horlzons3 


Panel  A:   t-S ta t i s t i cs4  on  Differences  in  Average  Absolute  Error 

Horizon:      240        180        120        60 


Dependent 
Variable : 5 

(3.1) 


(3.2) 


(3.3) 


mean  -  med  i  an 

mean  -  current 

median  -  current 

mean  -  median 

mean  -  current 

median  -  current 

mean  -  med  i  an 

mean  -  current 

median  -  current 


1 . 

.06 

-0  . 

,  98 

-1  . 

,  11 

-2  , 

,  14 

-1  . 

,62 

0 

.  11 

1 

,  07 

1  , 

.  82 

2 

.  83 

-0 

.02 

1 , 

.  18 

2. 

,05 

2  , 

,  93 

4  , 

.  97 

1  , 

.60 

0 

.83 

-0 

.  85 

-1 

.  12 

-1 

.70 

-1 

.  29 

0  , 

.  47 

0  , 

.60 

0 

.89 

1 

.68 

-0 

.  65 

0 

.36 

1 

.  44 

2 

.01 

3 

.  38 

0 

.  64 

0 

.33 

-0, 

.  28 

-0 

.50 

-1 

.  12 

-0 

.82 

0 

.07 

0 

.  35 

0 

.58 

1 

.  77 

0 

.29 

0 

.40 

0 

,64 

1 

.08 

2 

.  89 

1 

.  12 

Panel  B:   t-S tat i s t ics4  on  Differences  in  Average  Squared  Error 

Dependent  Horizon:     240       180       120       60 

Variable:5 

(3.1) 


(3.2) 


(3.3) 


mean  --    median 

-0  . 

,  26 

-0  , 

.  04 

0  , 

.20 

-0 

,  83 

-0  , 

,59 

mean  -  current 

-0 

.  44 

0 

.30 

1 

.06 

1 

.  09 

-0 

.  10 

median  -  current 

-0, 

,  18 

0 

.  34 

0  , 

.86 

1  , 

.91 

0  , 

,50 

mean  -  median 

-0 

.  15 

-0 

.  07 

0 

.  05 

-0 

.  15 

-0 

.  12 

mean  -  current 

-1  , 

.  22 

-0  , 

.  64 

0  , 

,  26 

-0. 

.  02 

-0  , 

,33 

median  -  current 

-1 

.06 

-0 

.57 

0 

.21 

0 

.  13 

-0 

,  21 

mean  -  median 

.  00 

-0 

.02 

0  , 

,  09 

-0  , 

,  16 

0  , 

02 

mean  -  current 

-0 

.  01 

.  00 

-0 

.  25 

0 

.46 

0, 

,76 

median  -  current 

-0  , 

,  01 

0 

.  02 

-0  , 

,  34 

0  , 

,61 

0  . 

74 

Notes  : 

^The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  3  for 
average  absolute  errors,  and  Table  4  for  the  average  squared 
errors.   The  computations  are  described  in  equations  (3.10) 
through  (3.15)  in  the  text. 
^The  forecast  sources  are: 

mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts. 


the 
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^The 


current  -  the  most  recent  forecast  from  an  analyst, 
forecast  horizons  are  measured  in  trading  days  prior  to  the 
annual  earnings  announcement. 

degrees  of  freedom  for  ail  reported  t-statistics  are  over 
2,000,  so  they  are  approximately  normal.   For  a  two-sided  test, 
the  .05  and  .01  critical  points  from  the  N(0,1)  distribution  are 
1.96  and  2.58,  respectively. 
^The  dependent  variables,  denoted  (3.1),  (3.2)  and  (3.3),  refer  to 
equation  numbers  in  the  text,  and  are  alternat*  definitions  of 
the  forecast  error. 
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Table  6 


Pairwise  Differences  in  Forecast  Accuracyl 
Between  Analysts  and  a  Quarterly  Autoregress i ve  Model, 2 
for  Five  Forecast  HorlzonsS 


Panel  A:   t-Sta t i s t lcs4  on  Differences  in  Average  Absolute  Error 


Dependent 
Variable:5 

(3.1) 


Quarter : 
Horizon : 


q.a.r.  -  mean 

q.a.r.  -  median 

q.a.r.  -  current 


1 
240 

5  .  85 

4  .  79 

5  .  96 


2 

180 

4  .  15 
3  .  17 

5  .  22 


3 
120 

2  .  85 
1  .74 
4  .  67 


4 

60 

2  .  36 

■4  .  51 

0  .  46 


q.a.r.  -mean      4.54 

(3.2)     q.a.r.  -  median      3.71 

q.a.r.  -  current      4.07 


28 
43 
88 


82 
70 
71 


-3  .  06 
-4  .  76 
-1  .  38 


q.a.r.  -  mean 

(3.3)     q.a.r.  -  median 

q.a.r.  -  current 


74 
41 
81 


34 
06 
69 


55 
05 

13 


-0  .  67 

-1  .  79 

1.11 


Panel  B 


t-Stat istics4 


on  Differences  in  Average  Squared  Error 


Quarter : 

1 

2 

3 

4 

Dependent 

Horizon : 

240 

180 

120 

60 

Variable : 5 

q  .  a  . 

r .  -  mean 

5  .  08 

3  .  16 

2.73 

-0, 

.22 

(3.1) 

q.a.r. 

-  med  ian 

4.81 

3.13 

2.93 

-1 

.04 

q.a.r. 

-  current 

4  .  64 

3.47 

3  .  79 

0 

,87 

q  .a  . 

. r .  -  mean 

3  .  00 

0.34 

0.99 

-1 

.  38 

(3.2) 

q.a.r. 

-  median 

2.85 

0.27 

1  .  04 

-1  , 

.  53 

q.a.r. 

-  current 

1  .  79 

-0  .  30 

1  .  25 

-1 

.40 

q  .a  , 

. r .  -  mean 

1  .30 

1.11 

0  .48 

0 

.  81 

(3.3) 

q.a.r 

-  med  ian 

1  .  30 

1  .  09 

0  .  57 

0 

.65 

q.a.r. 

-  current 

1  .  29 

1  .  10 

0  .  23 

1 , 

,  26 

Notes  : 


The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  3  for 
average  absolute  errors,  and  Table  4  for  the  average  squared 
errors.   The  computations  are  described  in  equations  (3.10) 
through  (3.15)  in  the  text. 


the 
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^The  forecast  sources  are: 

q.a.r.  -  a  quarterly  autoregress i ve  model  in  fourth  differences; 
equation  (2.3)  in  the  text, 
■ean  -  the  nean  of  the  available  analysts'  forecasts. 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 
^The  forecast  horizons  are  measured  in  trading  days  prior  to  the 

annual  earnings  announcement. 
^The  degrees  of  freedom  for  all  reported  t-statistics  are  over 

2,000,  so  they  are  approximately  normal.   For  a  two-sided  test, 
the  .05  and  .01  critical  points  from  the  N(Q,1)  distribution  are 
1.96  and  2.58,  respectively. 
^The  dependent  variables,  denoted  (3.1),  (3.2)  and  (3.3),  refer  to 
equation  numbers  in  the  text,  and  are  alternate  definitions  of 
the  forecast  error. 
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Table  7 


Pairwise  Differences  in  Forecast  Accuracyl 
Between  Analysts  and  a  Quarterly  Random  Walk  Model, 2 
for  Four  Forecast  Horizons3 

Panel  A:   t-Stat i s t ics4  on  Differences  in  Average  Absolute  Error 


Dependent 
Variable:5 

(3.1) 


Quarter 
Horizon 


q . r . w .  -  mean 

q . r . w .  -  median 

q.r.w.  -  current 


1 
240 

5  .  56 

4  .  50 

5  .  67 


2 

180 

4  .  20 
3.  21 

5  .  27 


3 

120 

3  .  92 
2  .  81 
5  .  74 


4 

60 

-1  .  70 

-3  .84 

1  .  13 


q.r.w.  -  mean 

(3.2)     q.r.w.  -  median 

q.r.w.  -  current 


4  .  75 

3  .  92 

4  .  29 


45 
61 
05 


2  .  86 
1  .  74 

3  .  75 


-2  .  57 
-4  .  27 
-0  .  89 


q.r.w.  -mean      2.68 

(3.3)     q.r.w.  -  median      2.35 

q.r.w.  -  current      2.75 


33 
04 
68 


80 
29 
37 


0  .  32 

-0  .  80 

2  .  10 


Panel  B 


t-Statistics  on  Differences  in  Average  Squared  Error 


Dependent 
Variable  :  5 

(3.1) 


(3.2) 


(3.3) 


Quarter 


q.r.w.  -  mean 

q.r.w.  -  median 

q.r.w.  -  current 

q.r.w.  -  mean 

q.r.w.  -  median 

q.r.w.  -  current 

q.r.w.  -  mean 

q.r.w.  -  median 

q.r.w.  -  current 


1 

2 

3 

4 

240 

180 

120 

60 

4  .49 

3.26 

3.09 

-0 

.  38 

4  .  22 

3.23 

3.29 

-1 

.  21 

4  .  05 

3  .  56 

4.15 

0 

.70 

2.80 

0.21 

1  .00 

-1 

.37 

2  .  65 

0.  14 

1  .  05 

-1 

.  52 

1  .  59 

-0.43 

1  .26 

-1 

.39 

1  .08 

,  1  .46 

1  .  81 

1 

.71 

1  .08 

1.44 

1  .90 

1 

.55 

1  .06 

1  .46 

1  .  56 

2 

.  17 

Notes  : 

•^The  reported  numbers  are  t-statistics  on  pairwise  differences  in 

average  absolute  or  squared  forecast  error.   See  Table  3  for  the 
average  absolute  errors,  and  Table  4  for  the  average  squared 
forecast  errors.   The  computations  are  described  in  equations 
(3.10)  through  (3.15)  in  the  text. 
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^The  forecast  sources  are: 

q.r.w.  -  a  quarterly  random  walk  model  in  fourth  differences; 
equation  (2.4)  in  the  text. 
■ean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 
^The  forecast  horizons  are  measured  in  trading  days  prior  to  the 

annual  earnings  announcement. 
■*The  degrees  of  freedom  for  all  reported  t-statistics  are  over 

2,000,  so  they  are  approximately  normal.   For  a  two-sided  test, 
the  .05  and  .01  critical  points  from  the  N(0,1)  distribution  are 
1.96  and  2.58,  respectively. 
^The  dependent  variables,  denoted  (3.1),  (3.2)  and  (3.3),  refer  to 
equation  numbers  in  the  text,  and  are  alternate  definitions  of 
the  forecast  error. 
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Table  8 


Summary  of  Regression  Results: 
Adjusted  R2  and  Regression  F-statisticl  for  Regression 
of  EPS  Forecast  Error  on  Excess  Return2 


Dependent 

Horizon : 

240 

180 

120 

60 

5 

Variable : 

R2 

.  116 

.  095 

.  107 

.  114 

.  156 

Unsealed 

F(k-1 

,  N- 

-k) 

5  .  02 

4.27 

4  .79 

5.09 

4  .60 

error 

k 

199 

199 

199 

199 

195 

[Eqn.  (3.1)] 

N 

5986 

6171 

6267 

6293 

3779 

R2 

.  139 

.  138 

.  129 

.  144 

.  154 

Scaled 

F(k-1 

,N- 

-k) 

5  .  80 

5  .  93 

5  .  61 

6  .  25 

4  .  51 

error 

k 

195 

195 

195 

195 

191 

[Eqn.  (3.2)] 

N 

5783 

5966 

6057 

6083 

3653 

R2 

.  104 

.099 

.097 

.088 

.  138 

%    error 

F(k-1 

.  N- 

-k) 

4  .49 

4  .37 

4  .  35 

4  .  02 

4  .  08 

[Eqn.  (3.3)] 

k 

199 

199 

199 

199 

195 

N 

5928 

6111 

6202 

6228 

3740 

Notes  : 

^The  reported  F-statistics  have  degrees  of  freedom  k-1  and  N-k,  where 
k  is  the  number  of  estimated  parameters,  including  the  intercept, 
and  N  is  the  number  of  observations.   Selected  critical  points 
for  the  F  distribution  are; 

F( 120 , 120)         1 . 77 
F(  120  ,  oo  )         1.45 

^The  estimated  regression  model  is  equation  (3.17)  in  the  text. 
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Table  9 


Sunaary  of  Regression  Results: 
Incremental  F-stat isticsl  for  Groups  of  Parameters, 
in  the  Regression  of  EPS  Forecast  Error  on  Excess  Return2 


Dep  . 
V'ble; 
(3.1) 


year 


f  irn 


excess 
return 


Horizon:  240 

F(kl,N-k)  54.13 

kl  6 

F(k2,N-k)  3.34 

k2  183 

F(k3,N-k)  6.89 

k3  5 


180 


120 


60 


36.79 
6 

25.17 
6 

19.25 
6 

2.94 
6 

3  .  08 
-  183 

3.87 
183 

4.82 
183 

4.65 
183 

7.33 
5 

14.37 
5 

3.57 
5 

2.18 
3 

N-k 


5787 


5972 


6068 


6094 


3584 


(3.2)     year 


F(kl ,N-k) 
kl 


48  .82 
6 


47  .56 
6 


25  .  77 
6 


19  .04 
6 


3.47 
6 


f  i  rm 


F(k2 .N-k) 
k2 


4  .  36 
179 


4  .  54 
179 


4  .  86 
179 


6  .  10 
179 


4.61 
179 


excess 
return 


F(k3,N-k) 
k3 


4.71 
5 


4  .74 
5 


8.  69 

5 


1  .64 

5 


1  .  19 
3 


N-k 


5588 


5771 


5862 


5888 


3462 


(3.3)     year 


F(kl ,N-k) 
kl 


18  .  94 
6 


15  .  9 
6 


18.  21 
6 


15.47 
6 


2.71 
6 


firm 


F(k2 ,N-k) 
k2 


4  .  08 
183 


3  .  98 
183 


3  .  83 
183 


3  .74 
183 


4.19 
183 


excess 
return 


F(k3 .N-k) 
k3 


2  .  51 

5 


3  .  07 
5 


7  .  28 
5 


1  .  72 
5 


0  .  18 
3 


N-k 


5729 


5912 


6003 


6029 


3545 


Notes : 


^The  reported  F-statistics  test  the  incremental  explanatory  power  of 

including  groups  of  parameters  in  the  models.   The  hypotheses  are 
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discussed  in  section  5  of  the  paper.   Selected  critical  points 
for  the  F  distribution  are: 


a    =_^05 

«_=    ^001 

F(       3, 

120) 

2.68 

5.78 

F(        3, 

«    ) 

2  .60 

5.42 

F(        5, 

.  120) 

2.  29 

4.42 

F(        5, 

«     ) 

2.21 

4.10 

F(        6, 

,  120) 

2  .  18 

4  .04 

F(        6, 

«     ) 

2.  10 

3.74 

F( 120, 

,  120  ) 

1  .61 

1  .  77 

F(  120  . 

•     «    ) 

1  .43 

1.54 

^The  estimated  regression  model  is  equation  (3.17)  in  the  text 
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Table  10 


Summary  of  Regression  Results: 
Slope  Coefficients!  for  Five  Forecast  Sources, 2 
in  the  Regression  of  EPS  Forecast  Error  on  Excess  Return3 

( t-statistlcs4  are  in  parentheses) 


Horizon : 

240 

180 

120 

60 

5 

Source: 

Panel  A: 

q  .  a  .  r  . 

0.98 

0.96 

0.99 

0.48 

Unsealed  error 

(5.38) 

(5.72) 

(6.46) 

(3.26) 

[Equation 

(3.3)3 

q. r .w. 

0  .29 

0  .  24 

0.  63 

0.  18 

(1.61) 

(1.44) 

(4.15) 

(1.25) 

vean 

0.14 

0.01 

0.34 

0  .  14 

0.66 

(0.78) 

(0.04) 

(2.20) 

(0.96) 

(1.68) 

median 

0.07 

0  .  07 

0  .41 

0.  20 

0  .77 

(0.40) 

(0.40) 

(2.68) 

(1.38) 

(1.97) 

current 

0.32 

0.29 

0.53 

0.  37 

0.25 

(1.75) 

(1.75) 

(3.49) 

(2.49) 

(0.64) 

Panel  B: 

q  .a.r . 

0.95 

1  .  15 

1.25 

0.55 

Scaled  error 

(3.18) 

(4.68) 

(5  .00) 

(2.25) 

[Equat  ion 

(3.2)] 

q  .  r  .  w  . 

0.48 

0  .33 

0  .  78 

0.  17 

(1.61) 

(1.33) 

(3.11) 

(0 . 68) 

mean 

0.55 

0  .  07 

0  .  45 

0  .  10 

0  .  49 

(1.85) 

(0.29) 

(1 .80) 

(0.40) 

(0.67) 

median 

0.49 

0  .  14 

0  .  53 

0  .  21 

0  .  88 

(1.65) 

(0.55) 

(2.14) 

(0.84) 

(1.21) 

current 

0.73 

0.18 

0  .76 

0  .43 

-0  .  88 

(2.44) 

(0.72) 

(3.04) 

(1.75) 

(-1.21) 
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Table  10  (Continued) 


Panel  C: 
%    error 
[ Equat  ion 
(3.3)  J 


Horizon 
Source : 
q  .  a  .  r  . 


q  .  r  .  w  . 


mean 


med  i  an 


current 


240 

0  .  34 
(2.74) 

0  .  18 

(1.47) 

0.  12 
(0  .  95) 

0  .  11 
(0  .  88) 

0.16 
(1.34) 


180 

0  .  38 
(3.58) 

0  .  15 
(1.42) 

0.05 
(0  .49) 

0  .06 
(0.53) 

0.10 
(0 .96) 


120 

0.  35 
(4.26) 

0  .  27 
(3.23) 

0.  15 

(1  .84) 

0  .  16 
(1.92) 

0  .  22 
(2  .65) 


60 

0  .  15 
(2 .36) 

-0  .  01 
(-0.15) 

0.02 
(0.25) 

0  .03 
(0.47) 

0.11 
(1.83) 


-0  .00 
(-0.01) 

0  .  10 
(0.72) 

0.  02 
(0.14) 


Notes  : 

^Separate  slope  coefficients  are  estimated  for  each  of  the  forecast 
sources.   The  statistical  significance  of  the  correlation 
between  forecast  errors  and  excess  returns  is  measured  by  the 
t-statistic  on  the  slope  coefficient. 
^The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au toregress i ve  model  in  fourth 

differences;  equation  (2.3)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2.4)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst, 
estimated  regression  model  is  equation  (3.17)  in  the  text, 
t-statistics  are  approximately  normal.   For  a  one-sided  test, 
the  .05  and  .01  critical  points  from  the  N(0,1)  distribution  are 
1.65  and  2.33,  respectively. 


^The 
^The 
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Table  11 

PaiTwise  Differences  in  the  Association 
between  Forecast  Errors 
and  Excess  Returns. 1 

Panel  A:   t-Stat istics2  on  Differences  between  Quarterly  Model  (2.3) 

and  Other  Forecast  SourcesS 


Dependent 
Variable  :  4 


(3.1) 


Quarter:  12  3  4 

Horizon:  240        180  120  €0 

q.a.r.  -  q.r.w.  2.69      3.08  1.68  1.47 

q.a.r.  -  mean  3.28      4.08  3.09  1.68 

q.a.r.  -  median  3.55      3.82  2.74  1.37 

q.a.r.  -  current  2.59      2.85  2.16  0.56 


q.a.r.  -  q.r.w.  1.12  2.41  1.37  1.15 

q.a.r.  -  mean  0.95  3.15  2.32  1.35 

(3.2)  q.a.r.  -  median  1.09  2.97  2.08  1.03 
q.a.r.  -  current  0.53  2.85  1.43  0.36 

q.a.r.  -  q.r.w.  0.91  1.55  0.74  1.84 

q.a.r.  -  mean  1.28  2.22  1.75  1.54 

(3.3)  q.a.r.  -  median  1.33  2.19  1.70  1.38 
q.a.r.  -  current  1.00  1.88  1.17  0.39 


Panel  B:   t-Stat i st i cs2  on  Differences  between  the  Most  Current 

and  the  Mean  and  Median  Analyst  Forecasts3 


Dependent 
Variable:4 

Horizon : 

240 

180 

120 

60 

5 

(3.1) 

mean  - 
median  - 

-  current 

-  current 

-0  .  69 
-0  .  96 

-1  .  23 
-0  .  97 

-0  .  94 
-0  .  59 

-1  .  12 
-0  .  82 

0  .  76 
0.97 

(3.2) 

mean  - 
median  - 

-  current 

-  current 

-0  .  42 
-0.56 

-0  .  30 
-0.12 

-0  .  90 
-0.65 

-0.99 
-0.67 

1  .  37 
1  .  77 

(3.3) 

mean  - 
median  - 

-  current 

-  current 

-0  .  28 
-0  .  33 

-0.34 
-0  .  31 

-0  .  58 
-0.52 

-1  .  15 
-0  .  99 

-0.11 
0  .  42 

Notes  : 

^The  estimated  regression  model  is  equation  (3.17)  in  the  text. 
^The  t-statistics  are  approximately  normaJ.   For  a  one-sided  test, 

the  .05  and  .01  critical  points  from  the  N(0,1)  distribution 

are  1.65  and  2.33,  respectively. 
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''The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au t or egres s i ve  model  in  fourth 

differences;  equation  (2.3)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth 
differences;  equation  (2.4)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 
^The  dependent  variables  are  transformations  of  the  forecast  error 
described  in  equations  (3.1),  (3.2)  and  (3.3)  in  the  text. 
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TDOTNOIES 

^Differences  among  analyst  conposite  forecasts  ar«  qualitatively 
siailar  in  a  larger  sample  of  firms,  not  subject  to  the  time- 
series  models'  selection  bias. 

^An  example  of  diversification  across  forecast  sources  to  improve 
accuracy  is  discussed  by  Beaver  (1981),  where  the  forecasts  are 
for  sports  scores. 

^See  Granger  and  Newbold  (1977)  for  a  discussion  and  review  of 
some  evidence,  and  Figlewski  and  Urich  (1982)  for  a  more  recent 
examination.   These  studies  consider  predictions  of  macroeconoTnic 
var  iabies . 

'^Froin  a  sample  of  over  45,000  forecasts  on  457  firms  in  7  years, 
when  a  requirement  was  imposed  tiiat  the  same  analyst  produce 
forecasts  for  a  firm  in  at  least  six  years,  the  sample  size  was 
reduced  to  138  forecasts  on  15  firms.   This  is  fewer  than  two 
forecasts  per  firm-year,  on  average. 

^This  criterion  can  be  criticized  for  overemphasizing  a  cross- 
sectional  outperf ormance ,  which  may  be  anecdotal  given  that  the 
series  are  not  independent.   I  criticize  previous  studies  of 
analysts'  forecasts  on  these  grounds  in  section  3.2. 

®  "Earnings"  and  "earnings  per  share"  (or  "EPS"')  are  sometimes 
used  interchangeably  in  this  paper.   The  data  consist  of 
forecasts  of  EPS,  and  I  wish  to  abstract  from  the  effects  of 
changes  in  capitalization  on  the  prediction  of  earnings.   When  a 
change  in  capitalization  is  announced  between  a  forecast  date  and 
a  realization,  1  adjust  the  forecast  by  the  actual  change  in 
capitalization.   Moreover,  comparisons  across  years  are 
accomplished  by  reducing  all  years  to  a  common  capitalization 
basis  . 

"^The  results  on  predictive  accuracy  and  bias  were  replicated  for 
three  other  scales:   a  variant  of  (3.3)  using  the  standard 
deviation  in  place  of  the  absolute  deviation;  a  variant  of  (3.2) 
where  the  sample  was  censored  to  exclude  negative  denominators; 
and  a  variant  of  (3.2)  where  the  sample  was  censored  to  exclude 
denominators  smaller  than  $.20.   The  results  presented  in  this 
paper  are  representative  of  the  larger  set. 

^This  discussion  does  not  argue  against  the  existence  of  firm- 
specific  effects  which  persist  through  time.   It  argues  against 
an  explanation  for  such  effects  based  on  unanticipated 
information  . 

^The  subscript  i.  which  indexes  the  source  of  the  forecast,  is 
suppressed  in  the  following  discussion  for  readability. 

l^The  estimation  was  also  done  with  f i rm- spec i f i c  effects  in  the 
model.   The  bias  results  do  not  differ  from  those  reported  here. 


11 


Th 


coef 
ef  f  e 
If  t 
(3.1 
var  i 
DF2) 
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e  estimated  effects  are  not  the  same  as  the  estimated 
ficients,  because  models  (3.10)  and  (3.11)  have  two  sets  of 
cts,  firms  and  years.  A  simple  example  will  Illustrate  this, 
here  were  two  years  and  three  firms  in  the  sample,  then  model 
0)  could  be  estimated  with  no  intercept,  using  two  year  dummy 
ables  (DYl  and  DY2 )  and  two  firm  dummy  variables  (DFl  and 


I  e  .,  I  =  d, ,  DFl 
'  J  tx '     llT 


cl,3^DF2 


+  d^,  DYl 
2lT 


d32,DV2 


jtx 


The  "year  1  effect"  is  the  average  |ejt^|  for  t=l.   This  effect 
is  not  estimated  by  d2ix.   Rather,  d2iT  is 'the  average  lejtxl  for 
year  1  for  the  omitted  (third)  firm.   The  "year  1  effect"  is 
estimated  in  this  formulation  by: 


The 


The 


21x     21x 

"firm  1  effect" 


+  (1/3)  djj^  +  (1/3) 

is  estimated  by: 

+  (1/2)  d^^^  +  (1/2) 


llx     llx 
"firm  3  effect"  is  estimated  by 


12x 


22x 


13x 


(1/2)  d^^^  .  (1/2)  d^^^ 


Sine 
used 
es  t  i 


e  any  non-redundant  spanning  set  of  dummy  variables  can  be 
,  the  particular  linear  combinations  of  coefficients  to 
mate  the  firm  and  year  effects  depend  on  the  model  used. 


l^Dropping  the  ct-nj-^,     the  firm-specific  effects,  does  not  alter 
the  estimates  of  the  slope  coefficients  Bj-^  or  their  statistical 
significance  by  a  substantial  amount.   Omitting  the  a2itx> 
however,  alters  both  the  estimates  and  their  statistical 
significance. 

^^An  alternative  way  to  model  this  problem  is  to  include  the  year 
and  firm  effects  as  "random  effects",  contributing  off-diagonal 
elements  to  the  covariance  matrix  of  the  Vjj^.^.   Mundlak  (1978) 
shows  that  the  estimate  of  the  slope  coefficient  Bi-^  obtained 
from  a  model  like  (3.17),  is  identical  to  the  GLS  estimate  which 
would  be  obtained  if  the  firm-  and  year- spec  i  f  ic  effects,  O-nji 
and  ot2itx.  were  modeled  as  random  veffects  and  included  in  the 
covariance  matrix. 


l^Brown,  Foster  and  Noreen  (1984)  analyze  a  related,  but 
different,  dataset,  taken  from  the  set  of  summary  statistics 
which  I/B/E/S  produces  for  clients. 

^^Firms  announce  earnings  with  remarkable  consistency  year  after 
year,  so  choosing  fixed  lengths  of  time  prior  to  the  announcement 
date  is  a  fairly  accurate  means  of  finding  dates  that  differ  by 
one  quarter's  earnings.   Of  the  6218  horizons  included  in  this 
paper,  13  dates  did  not  fall  between  the  quarterly  announcements 
as  intended.   The  results  are  not  affected  by  deletion  of  these 
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observations 


^^Results  did  not  differ  fro*  those  reported  here  when  the  value- 
weighted  aarket  portfolio  of  securities  was  used  as  a  proxy  for 
the  market. 

•^"^For  example,  see  Dirks  and  Gross,  The  Great  Wall  Street 
Scandal,  especially  pp.  252-257.   Also  see  "Bank  Analysts  Try  to 
Balance  their  Ratings",  WSJ,  May  29,  1984,  p.  33;  and   "Picking  a 
Loser",  WSJ.  September  28.  1983,  p.  1. 

^^An  alternate  method  of  measuring  the  association  between 
forecast  errors  and  excess  returns,  similar  to  that  of  Ball  and 
Brown  (1968),  constructs  portfolios  based  on  foreknowledge  of  the 
sign  of  EPS  forecast  error.   That  is,  a  long  position  is  taken  in 
each  of  the  securities  for  which  the  forecast  error  is  positive, 
and  a  short  position  is  taken  in  those  with  negative  errors. 
This  procedure,  applied  to  these  data,  produces  results 
qualitatively  identical  to  those  reported  here. 
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